The 1997 Bbn Byblos System Applied to Broadcast News Transcription

نویسندگان

  • Francis Kubala
  • Jason Davenport
  • Hubert Jin
  • Daben Liu
  • Tim Leek
  • Spyros Matsoukas
  • David Miller
  • Long Nguyen
  • Fred Richardson
  • Richard Schwartz
  • John Makhoul
چکیده

In this paper, we describe the BBN Byblos system used for the 1997 DARPA Hub-4 Broadcast News evaluation and discuss numerous improvements made to the system in 1997. We focused our e ort entirely upon the two conditions containing studio-quality uncorrupted speech from native speakers, the so-called F0 (prepared speech) and F1 (spontaneous speech) conditions. In particular, we did not bother to create a separate acoustic model for narrow-band telephone speech. Our overall 1997 Hub-4 evaluation result was 20.4% WER, but our error rate on the F0/F1 conditions was only 14%. We ran regression tests on development test data that show we reduced word error rate by 22-30% on the F0/F1 conditions compared to our 1996 system. Sizable gains were achieved on all the other conditions as well, even though no extra e ort was spent toward improving them. Brief summaries of three related e orts are also given covering the use of Byblos for Spanish news transcription, near real-time transcription, and automatic extraction of named entities from broadcast news.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The 1998 BBN BYBLOS Primary System applied to English and Spanish Broadcast News Transcription

In this paper, we describe the BBN BYBLOS system used for the 1998 Hub-4E primary and Hub-4Sp evaluation benchmarks, and discuss the improvements made to the system in 1998. We focus on the techniques that were new in this year’s system, including processing of the acoustic training data, test segmentation, revised cepstral normalization and Vocal Tract Length Normalization (VTLN), band-specifi...

متن کامل

Toward realtime transcription of broadcast news

In this paper, we describe our recent work in fast automatic transcription of broadcast news programming from radio and television. Given our state-of-the-art BBN BYBLOS primary system [1] running at 230 times real time (230xRT) we show that eliminating and approximating many computationally expensive components speeds up the system by a factor of more than 20 without significant loss in recogn...

متن کامل

The 1999 BBN BYBLOS 10xRT Broadcast News Transcription System

In this paper, we describe the BBN BYBLOS system used for the 1999 Hub-4E 10xRT evaluation benchmark, and discuss the improvements made to the system in 1999. We focus on the techniques that were new in this year’s system to achieve an optimal tradeoff between accuracy and speed for the evaluation benchmark test. Overall, we improved the recognition accuracy on the 1998 Hub-4E evaluation test b...

متن کامل

Broadcast news transcription

In this paper we describe our recent work on automatic transcription of radio and television news broadcasts. This problem is very challenging for large vocabulary speech recognition because of the frequent and unpredictable changes that occur in speaker, speaking style, topic, channel, and background conditions. Faced with such a problem, there is a strong tendency to try to carve the input in...

متن کامل

Japanese broadcast news transcription

In this paper, we describe the on-going development of a Japanese Broadcast News Transcription system at BBN Technologies. This is a collaboration between BBN and NHK to use automatic speech recognition technology to provide live closed caption for NHK’s TV news programs in Japan. We describe what the NHK Broadcast News Corpus comprises and how we adopted transcription technology developed for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998